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A method of training acoustic models of a segmentation-based automatic speech 
recognition system, comprising the steps of: 

receiving correct alignment data that represents a correct segment alignment of an 
utteran&e that was received by the speech recognition system; 

receiving wrong aUenment data that represents an alignment of the utterance that is 
known to be incorrect based on information received from the speech 
recognition system and describing the utterance; 

identifying a first phoneme m the wrong alignment data that corresponds to a second 
phoneme in the correct alignment data; 

modifying a first acoustic model orthe first phoneme by moving at least one mean 
value thereof further from the feature values used to score the first phoneme. 



12. A method as recited in Claim 1, further compnsing the steps of: 

2 receiving correct alignment4atajhat represents asp alignment of the utterance that is 

3 known to be correct based on information deceived from the speech 

4 recognition system and describing the utterance; 

5 identifying a second phoneme in the correct alignmen^data that corresponds to the 

6 first phoneme in the wrong alignment data; 

7 modifying a second acoustic model of the second phonemd\by moving at least one 

8 mean value thereof closer to the feature values used tovscore the second 

9 phoneme. 
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A method as recited in Claiiml, wherein receiving correct alignment data comprises 
the step of receiving correct-aliOTnient data that represents a segment ahgnment of a 
highest scoring hypothesized aligmnent selected from among n-best hypotheses of an 
utterance that was received by the spe^sch recognition system. 



A method as recited in Claim 1, wherein receiving wrong alignment data comprises 
the steps of receiving wrdngrahgnment data that represents an alignment of the 
utterance that is known to be incorrect based on user confirmation information 
received from the speech recognition system in response to prompting a speaker to 
confirm the utterance. 



A method as recited in Claim 1, wherein receiving correct alignment data comprises 
the steps of receiving correcVaHgiunent data that represents an alignment of the 
utterance that is known to be correct based on user confirmation information received 
from the speech recognition system in response to prompting a speaker to confirm the 
utterance. 

A method as recite^in ClaimJ^^^rther comprising the step of iteratively repeating the 
identifying and modifying steps for all phonemes in the wrong alignment data that 
correspond to one or more phonemes in the correct alignment data. 
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17. A method as^^e^ in Claim 2, further comprising the step of iteratively repeating the 

2 identifying and modifying steps for all phonemes in the correct alignment data that 

3 correspond to one omnore phmiemes in the wrong alignment data. 

1 8. A method as recited in Claim 1, wherein the step of moving at least one mean value 

2 further from a corresponding mean value of a second acoustic model of the second 

3 phoneme comprises subtractmg"the mean value of the third acoustic model from the 

4 mean value of the second acoumc model. 

19. A method as recited in Claim 1, wherein the step of moving at least one mean value 

2 further from a correspondingjnean^value of a second acoustic model of the second 

3 phoneme comprises reducing the meaia value of the third acoustic model by 

4 approximately two percent (2%). \ 

1 10. A method as recited in Claim 1, wherein modifying a first acoustic model further 

2 comprises the steps of modi^ongall acoustic models associated with the first 

3 phoneme by moving all mean values thereof further from corresponding mean values 

4 of all second acoustic models associated with the second phoneme. 

1 11. A method as recited in Claim 2, wherein modifying aVhird acoustic model further 

2 comprises the steps of modifying all acoustic models associated with the third 

3 phoneme by moving all mean values thereof closer to ccK^sponding mean values of 

4 all acoustic models associated with the second phoneme 
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1 12. A method of improVing performance of a segmentation-based automatic speech 

2 recognition system (ASR) by training its acoustic models using information obtained 

3 from a particular application in which the ASR is used, comprising the steps of: 

4 receiving a correct segment alignment of an utterance that was received by the ASR; 

5 receiving an alignment of tWe utterance that is known to be incorrect based on 

6 information received rtom the speech recognition system in the contextofthe 

7 particular application; \ 

8 identifying a first phoneme in the Known incorrect alignment that corresponds to a 

9 second phoneme in the correct segment alignment; 

10 modifying a first acoustic model of the first phoneme by moving at least one mean 

1 1 value thereof fiirther from a consesponding mean value of a second acoustic 

1 2 model of the second phoneme. \ 

1 13. A method as recited in Claim 12, fiirther comprising the steps of: 

2 receiving an alignment of the utterance that isWnown to be correct based on 

3 information received from the speech recognition system in the context of the 

4 particular appUcation; \ 

5 identifying a third phoneme in the known correct alignment that corresponds to the 

6 second phoneme in the correct alignment; \ 

7 modifying a third acoustic model of the third phoneme py moving at least one mean 

8 value thereof closer to the corresponding mean value of the second acoustic 

9 model of the second phoneme. \ 
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1 14. A computer-readable meaium carrying one or more sequences of instructions for 

2 training acoustic models of a segmentation-based automatic speech recognition 

3 system, wherein execution oVthe one or more sequences of instructions by one or 

4 more processors causes the one or more processors to perform the steps of: 

5 receiving correct alignment dara that represents a correct segment alignment of an 

6 utterance that was receivbd by the speech recognition system; ^ 

7 receiving wrong alignment data tmat represents an ahgnment of the utterance that is 

8 known to be incorrect basefl on information received from the speech 

9 recognition system and descidbing the utterance; 

10 identifying a first phoneme in the wrong alignment data that corresponds to a second 

1 1 phoneme in the correct alignment data; 

12 modifying a first acoustic model of the lirst phoneme by moving at least one mean 

13 value thereof fiirther from a corresponding mean value of a second acoustic 

14 model of the second phoneme. \ 

1 15. A computer-readable medium as recited in Claim 14, wherein the instructions further 

2 comprise instructions for carrying out the steps\of^~^ 

3 receiving an aUgnment of the utterance that is krrown to be correct based on 

4 information received from the speech recognition system in the context of the 

5 particular application; \ 

6 identifying a third phoneme in the known correct aUgnment that corresponds to the 

7 second phoneme in the correct alignment; \ 



47898-056 



-27- 



8 modifying a third acoustic model of the third phoneme by moving at least one mean 

9 value thereof clos^ to the corresponding mean value of the second acoustic 
10 model of the second mioneme. 

1 16. A segmentation-based automatic\speech recognition system that provides improved 

2 performance by training its acoustic models according to information about an 

3 application with which the system isVised, comprising: 

4 a recognizer that includes one or moreWocessors; 

5 non- volatile storage coupled to the recognizer and comprising a plurality of 

6 segmentation alignment data and plurality of acoustic models; 

7 a computer-readable medium coupled to thArecognizer and carrying one or more 

8 sequences of instructions for the training acoustic models, wherein execution 

9 of the one or more sequences of instructions by the one or more processors 

10 causes the one or more processors to peribrm the steps of: 

1 1 receiving correct alignment data that represents a correct segment alignment of 

12 an utterance that was received by the speech recognition system; 

1 3 receiving wrong alignment data that represents\an alignment of the utterance 

14 that is known to be incorrect based on information received from the 

15 speech recognition system and describing me utterance; 

16 identifying a first phoneme in the wrong aUgnmentuata that corresponds to a 

1 7 second phoneme in the correct alignment data; 
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18 modifying a first acoustic model of the first phoneme by moving at least one 

1 9 mean value thereof fiirther fi-om a corresponding mean value of a 

20 second acoustic model of the second phoneme. 

1 17. A speech recognition system as recited in Claim 16, wherein the instructions fiirther 

2 comprise instructions for carrying out the steps of: 

3 receiving an alignment of the utterance mat is known to be correct based on 

4 information received from the speech recognition system in the context of the 

5 particular application; \ 

6 identifying a third phoneme in the known coWect alignment that corresponds to the 

7 second phoneme in the correct alignmeait; 

8 modifying a third acoustic model of the third phoneme by moving at least one mean 

9 value thereof closer to the corresponding mean value of the second acoustic 
1===^ 10 model of the second phoneme. \ 
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